Large-scale document image retrieval and classification with runlength histograms and binary embeddings

نویسندگان

Albert Gordo

Florent Perronnin

Ernest Valveny

چکیده

We present a new document image descriptor based on multi-scale runlength histograms. This descriptor does not rely on layout analysis and can be computed efficiently. We show how this descriptor can achieve state-of-theart results on two very different public datasets in classification and retrieval tasks. Moreover, we show how we can compress and binarize these descriptors to make them suitable for large-scale applications. We can achieve state-ofthe-art results in classification using binary descriptors of as few as 16 to 64 bits.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Local Primitive Histograms for Patent Binary Image Retrieval

Local primitives are useful in the analysis, recognition and retrieval of document and patent images. In this paper, local primitives are classified in 4 and 8-directional spaces at optimally detected junction and end points by using a distance based approach. Local primitives are quantized by using a variant of Local Binary Patterns. Spatial relationships between local primitives are establish...

متن کامل

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

Proximity-based Graph Embeddings for Multi-label Classification

In many real applications of text mining, information retrieval and natural language processing, large-scale features are frequently used, which often make the employed machine learning algorithms intractable, leading to the well-known problem “curse of dimensionality”. Aiming at not only removing the redundant information from the original features but also improving their discriminating abili...

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Pattern Recognition

دوره 46 شماره

صفحات -

تاریخ انتشار 2013

Large-scale document image retrieval and classification with runlength histograms and binary embeddings

نویسندگان

چکیده

منابع مشابه

Local Primitive Histograms for Patent Binary Image Retrieval

Document Analysis And Classification Based On Passing Window

Learning Document Image Features With SqueezeNet Convolutional Neural Network

Proximity-based Graph Embeddings for Multi-label Classification

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

عنوان ژورنال:

اشتراک گذاری